A Data Mining Architecture for Distributed Environments
نویسندگان
چکیده
Data mining offers tools for the discovery of relationship, patterns and knowledge from a massive database in order to guide decisions about future activities. Applications from various domains have adopted this technique to perform data analysis efficiently. Several issues need to be addressed when such techniques apply on data these are bulk at size and geographically distributed at various sites. In this paper we describe system architecture for a scalable and a portable distributed data mining application. The system contains modules for secure distributed communication, database connectivity, organized data management and efficient data analysis for generating a global mining model. Performance evaluation of the system is also carried out and presented.
منابع مشابه
WS-DAI-DM: An Interface Specification for Data Mining in Grid Environments
Providing the appropriate access means for data mining services in Grid Environment is principal for combination of Grid and data mining. The transition from centralized data mining process as they are in traditional tools to Grid-compliant and Grid-based data mining services that can coordinate with each other is important to extract useful and potential knowledge/patterns from distributed dat...
متن کاملGridMiner: An Infrastructure for Data Mining on Computational Grids
Knowledge discovery in datasets integrated into Grids is a challenging research task. These large datasets are being collected and accumulated across a wide variety of fields, at a dramatical pace. They are often heterogeneous and geographically distributed and globally used by large user communities. There are major challenges involved in the efficient and reliable storage, fast processing, in...
متن کاملThe Weka4WS framework for distributed data mining in service-oriented Grids
The service oriented architecture (SOA) paradigm can be exploited for the implementation of data and knowledge-based applications in distributed environments. The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid services and applications. WSRF can be exploited for developing high-level services for distributed data mining applications. T...
متن کاملA Web Service-based approach for data mining in distributed environments
Data mining is usually associated with centralized data mining systems. Here we present an approach to develop a data mining system in distributed environments. The main difficulty in this approach is the unrestricted sharing of information and dynamic integration of components. In this paper, we present a Web Service-based approach to solve these problems. The system built using this approach ...
متن کاملA Virtual Organisation deployed on a Service Orientated Architecture for Distributed Data Mining applications
Industrial and scientific research activity increasingly involves the geographically distributed utilisation of multiple tools, services and distributed data. Grid and Service Orientated Architecture concepts are being widely investigated as a means to deploy Virtual Organisations to support the needs for distributed collaboration. A generic Distributed Tool, Service and Data Architecture is de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002